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Abstract 

An induced forest of a graph G is an acyclic induced subgraph of G. The 
present paper is devoted to the analysis of a simple randomised algorithm that 
grows an induced forest in a regular graph. The expected size of the forest it 
outputs provides a lower bound on the maximum number of vertices in an induced 
forest of G. When the girth is large and the degree is at least 4, our bound 
coincides with the best bound known to hold asymptotically almost surely for 
random regular graphs. This results in an alternative proof for the random case. 

1 Introduction 

An induced forest in a graph G is an acyclic induced subgraph of G. The problem of 
finding a large induced forest in a graph G has been a widely studied topic in graph 
theory, especially in its form known as the decycling set problem or the feedback vertex 
set problem. A decycling set of a graph is a subset of its vertices whose deletion yields 
an acyclic graph. From this definition, we deduce that a set S C V is such that G[S] is 
an induced forest of G = {V, E) if and only if V \ S is a decycling set of G. Therefore, 
finding a lower bound for t(G), the maximum number of vertices in an induced forest of 
G, amounts to finding an upper bound for <p(G), the minimum cardinality of a decycling 
set of G. 

Historically, the problem of obtaining an acyclic subgraph of a graph G by removing 
vertices was already considered by Kirchhoff in his work on spanning trees [IT] . Erdos 
et. al. also worked on this problem stated in terms of maximum induced trees in a graph 
[8]. However, finding a decycling set of a given size in a graph is inherently difficult. 
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Indeed, this problem has been shown to be NP-complete |10| . even for special families 
of graphs such as bipartite graphs, planar graphs or perfect graphs. 

On the other hand, there exist polynomial algorithms to solve instances of this 
problem in cubic graphs [13], permutation graphs [H] and interval graphs [I5J. Also, 
tighter bounds or even the exact value of the decycling number have been determined 
for graphs such as grids and cubes in [2] and [I]. 

For random regular graphs with fixed degree r, upper and lower bounds on the size 
of a minimum decycling set have been obtained by Bau et al. in [3]. Their strategy 
relies on the analysis of a randomised greedy algorithm that generates a decycling set 
of a regular graph as it is exposed in the usual pairing model of random regular graphs. 

We investigate induced forests in r-regular graphs with large girth, where r > 3 is a 
fixed integer. By the girth of a graph G, we mean the length of a shortest cycle in G, if 
the graph contains a cycle, or infinity, if it is acyclic. We will extend the method initiated 
by Lauer and the second author to find lower bounds on the size of largest independent 
sets [12] in such graphs. The proof involves analysing the performance of an iterative 
randomised algorithm that generates an independent set in a graph. Although their 
algorithm is applicable to any graph, the number of iterations allowed is bounded by 
a function that increases with the girth, and, because of this, better bounds can be 
obtained as the girth increases. We shall use a similar approach to obtain bounds on 
the size of an induced forest in a graph whose girth is large. 

More precisely, we will prove the following. 

Theorem 1.1 Let 5 > and r G N. Then, there exists g > such that every r-regular 
graph G on n vertices with girth greater than or equal to g satisfies t(G) > (£(?") — 
S)n, where the constants £(r) are derived from the solution of a system of differential 
equations. Numerical values are given in the table below for some values of r. 



r 




S(r) 


3 


0. 7268 


0.2732 


4 


0.6 04 5 


0.3955 


5 


0.5269 


0.4731 


6 


0.4711 


0.5289 


7 


0.4283 


0.5717 


8 


0.3940 


0.6060 


9 


0.3658 


0.6342 


10 


0.3419 


0.6581 



Table 1. Lower bounds on t(G) and, in the last column, upper bounds on <p(G), where 
G is an r-regular graph with sufficiently large girth. 

An actual formula for the constants £ (r) will be given in Section [6j 

Consider H(r) = 1 — £(r) in the above table. Then, for any fixed r > 3 and 5 > 0, 
(H(r) + S)n gives an upper bound on the number of vertices in a minimum decycling set 
of an r-regular graph G with girth greater than or equal to the positive integer g referred 



2 



to in the theorem. For all values of r tested, with the exception of r = 3, these are the 
best bounds known for regular graphs with large girth. For r = 3, it can be shown that 
t(G) = 0.75 for every 3-regular graph with sufficiently large girth as a consequence of a 
result on fragment ability of graphs [7]. 

Observe that, if G is a graph with maximum degree r, then we can create an r- regular 
graph by taking copies of G and joining some pairs of vertices from different copies so 
as to make the resulting graph G' r-regular. This can be done without decreasing 
the girth if sufficiently many copies of G are used. Moreover, we have the inequality 
T {G) > T {G') because the copy of G containing the most vertices in a largest induced 
forest in G' satisfies this property. Thus, the theorem immediately implies the following 
result. 

Corollary 1.1 Let 5 > and r G N. Then, there exists g > such that every graph 
G on n vertices with maximum degree r and girth greater than or equal to g satisfies 
t(G) > (£(r) — 8)n, where £(r) is given in Table 1 for some values of r. 

Furthermore, the second author [T7j (see also [16]) and Bollobas [5] independently 
proved results implying that, if G is a random r-regular graph on n vertices, and g is 
any positive integer, then G asymptotically almost surely has o(n) cycles of length at 
most g. (For a sequence of probability spaces fl n , n > 1, an event A n of fl n occurs 
asymptotically almost surely, or a.a.s. for brevity, if lxm n ^, 00 P(A n ) = 1.) So, G a.a.s. 
can be turned into a graph G' with maximum degree r and girth at least g by deleting 
o(n) of its vertices. By the previous corollary, given 5 > 0, we can find g > such that 
G' contains an induced forest with at least (£(r) — S)(n — o(n)) vertices. If we delete 
from the forest all the vertices adjacent to a vertex of V(G) \ V(G'), we have an induced 
forest of G with at least (£(r) — 5) (n — o(n)) — o(ri) vertices. This leads to the following 
result. 

Corollary 1.2 Let r G N and let G be a random r-regular graph on n vertices. Fix 
e > 0. Then a.a.s. G contains an induced forest with (£(r) — e)n vertices, where £(r) is 
the constant given in Table 1. 

For r > 4, these bounds coincide with the corrected version of the best bounds known 
for random regular graphs obtained in [3]. The need for correction arises from a fault 
in the latter part of the argument, which relied upon greedily growing an induced forest 
in a random regular graph. The number of uninvestigated edges leading out of the 
forest at time t was denoted Y(t). Differential equations were set up which describe 
the likely behaviour of Y(t), and it was shown that the actual behaviour is close to the 
likely behaviour a.a.s. The argument is valid as long as Y(t) > 0. Unfortunately, the 
equations were traced after that point, so the bounds quoted in [3J for r > 4 are invalid. 
It is quite easy to correct this. At the time Y(t) falls to 0, all vertices adjacent to the 
growing forest are adjacent to at least two vertices in the forest and cannot be added to 
it without creating a cycle, and the forest is actually a tree T. From the values of the 
variables at that point, it is easy to obtain the number of vertices not adjacent to any 
vertices in the tree, and also the number of edges in the subgraph H induced by them. 
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Since the average degree is less than and bounded away from 1, and H is uniformly 
distributed, given its degree sequence, it is easy to show that the number of vertices in 
cycles of H is a.a.s. small (say less than logn), and thus almost all vertices of H can be 
added to T to obtain an induced forest in the original graph. This argument gives, for 
the random graph, the same bounds as in Table 1. 

The main goal of this paper is to establish Theorem II. 1[ whose proof is structured 
as follows. We first introduce a randomised greedy algorithm that finds an induced 
forest of a given graph. As with the discussion above on random regular graphs, the 
final part of this algorithm adds almost all the vertices in a set of "leftover" vertices. 
When this algorithm is applied to an r-regular graph G with sufficiently large girth, its 
expected performance leads to the bounds in Table 1, and hence guarantees the existence 
of an induced forest on the same proportion of vertices by the first moment principle. 
To estimate the expected performance of the algorithm, we shall establish preliminary 
lemmas that help us understand the behaviour of our algorithm, which will then be 
used to derive a system of recurrence equations involving the cardinality of the set of 
vertices in the induced forest. Finally, we shall approximate this system of recurrence 
equations by a system of ordinary differential equations whose solution provides us with 
the bounds mentioned above. 

Our method also produces (weaker) bounds on t(G) if a specific lower bound on the 
girth of G is given. However, we do not compute the precise constants for any particular 
bound on girth. 

We remark here that the method initiated in [12] and developed further in this paper 
can clearly be applied and adapted to obtain a wide range of results on large sets of 
vertices or edges in bounded degree graphs with large girth. A particularly powerful 
extension which the authors are planning is to permit prioritisation of a number of 
alternative steps in the greedy algorithm. Such steps are used in the most powerful 
algorithms known for finding independent sets or dominating sets in random regular 
graphs; see [IB] and [5J. 

2 An algorithm 

We introduce an algorithm that will help us find a large induced forest in a graph 
G = (V,E). At any given step of the algorithm, we shall associate colours with the 
vertices of the graph as follows. The colour purple is assigned to vertices in a set P, a 
subset of V that induces a subgraph of G with "a few" cycles only. A vertex is blue 
if it is not yet in P, but could join it in the next iteration, whereas orange is assigned 
to vertices whose addition to P would yield cycles in G[P]. The remaining vertices are 
coloured white and are the vertices not adjacent to vertices of the forest. 

Algorithm 2.1 

Input: A graph G, a positive integer N and a pair of probabilities (po,p). 

1. Start with all the vertices of the graph coloured white. In the first step, colour each 
vertex purple with probability p , at random, independently of all others. Non- 
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purple vertices are coloured blue if they have exactly one purple neighbour and 
orange if they have at least two purple neighbours. 

2. At each step i, choose blue vertices randomly and independently with probability 
p and colour them purple. The sets of blue and orange vertices are updated using 
the rule given in 1. We refer to the set of white vertices as W , to the set of blue 
vertices as B and to the set of purple vertices as P. Repeat this iteratively for N 
steps. 

3. Create a set P C P by deleting any pair of adjacent vertices added to P in a same 
step. 

Output: The acyclic set P and the set of white vertices W . 

In the first phase, the roots of the induced trees are chosen and coloured purple, and 
vertices that could be added to the trees without creating cycles or connecting distinct 
components are coloured blue. In each step of the second phase, the forest is extended 
by choosing blue vertices and adding them to P, and at each step the colours associated 
with each vertex are updated so that the sets of white, blue and orange vertices at the 
end of each step represent the vertices with 0, 1, and more than one, purple neighbours, 
respectively. Note that it would be possible to alter p at each step, and this would be 
useful if optimising the algorithm for the set of graphs with particular girth (as done 
in [12] for independent sets), but we do not do this here. 

The graph G[P] at the end of Phase 2 is not necessarily acyclic. As a matter of fact, 
it may happen that two neighbouring blue vertices are added to the forest in the same 
step and create a cycle. So, the set of purple vertices induces a subgraph with "a few" 
cycles, and the set of orange vertices is "almost" a decycling set of the graph. These 
cycles are broken in the third phase of the algorithm. 

A drawback to the analysis of Algorithm [2J] in its original version is that the random 
selection of vertices at a given step depends on the outcome of the previous steps. To 
avoid this, we introduce an equivalent model for which the random choices are uniform 
over the whole set of vertices. Indeed, with each vertex v G V, we shall first associate a 
random sequence of labels S(v) C {0, 1, 2, . . .} so that label i is in S(v) independently 
at random with probability po, if i = 0, or p, if i > 1. In other words, we choose sets of 
vertices at times 0, 1, . . ., and assign to a vertex v the labels {i : v was chosen at time i}. 
In the context of our algorithm, we shall then consider the set of vertices with label 
to be the set of vertices selected in Phase 1 and use vertices with label i to recreate the 
set of vertices added to P at step i in Phase 2 of our algorithm. It is clear that some of 
the labels are ill-suited. For instance, a vertex with label 1 will not be selected to join 
P at step 1 if it also has label 0, in which case it already belongs to P, or if none of its 
neighbours has label 0, in which case it is not blue after the first phase of the algorithm. 
This motivates a classification of the labels as relevant or irrelevant, that is, as labels 
that represent an action of our algorithm or as labels that do not. 

Definition 2.1 Relevant and irrelevant labels 
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Let G = (V, E) be a graph, and, for every v G V , let S{v) C N be the set of labels 
associated with v. We define relevant labels inductively (labels that are not relevant are 
said to be irrelevant) . A label i is relevant for v if: 

I. i = G S(v), or 

II. i G S(v), j is irrelevant for v for all j < i, and there is a unique neighbour of v with 
a relevant label strictly smaller than i. 

The sets of vertices with relevant label equal to i are denoted by Ri, while the ones 
with relevant label less than or equal to i are denoted by and i?<j. We refer to the 
sequence [S(v) : v G V] as <S. Now, for each I G N, the sequence S may be used to 
construct a colouring of G with colours purple, blue, white and orange. 

Definition 2.2 Colouring of G at time I 

Given a graph G and a sequence S as above, the colouring of G at time I G N is the 
function assigning colours purple, blue, orange and white to the vertices of G defined as 
follows. Given u G V , 

(a) u is white if u ^ R<i and v ^ R<i, for all v G N(u), where N(u) denotes the 
neighbourhood of u. 

(b) u is blue if u £ R<i and there is a unique v G N(u) such that v G R<i- 

(c) u is orange if u R<i and there exist distinct v, w G N(u) with v, w G R<i. 

(d) u is purple if ' u G R<i. 

It is clear from this definition that the colouring of G at time / is fully determined 
by the sequence [S(v ) fl{0, . . . , 1} : v G V]. Moreover, this colouring coincides with the 
colouring of the graph induced by our algorithm if we assume the set P after k steps to 
be R<ki as formalised by the next lemma. 

Lemma 2.1 Let G = (V, E) be a graph, and consider a subgraph H ofG and a colouring 
c of H with colours purple, blue, orange and white. Then, the following events have the 
same probability: 

(i) the colouring of G at time I induced by the sequence S = [S(v) : v G ^(G)] 
restricted to H is equal to c, where S is obtained by adding each nonnegative 
integer i to S(v) independently with probability p , ifi — 0, orp, ifi>l, for all 
v G V. 

(ii) Algorithm \2.1\ applied to G obtains c as the colouring of H after step I. 

Proof We modify Phase 2 our algorithm to allow all vertices to be chosen uniformly at 
random with probability p, instead of restricting our choices to blue vertices. However, 
no action is taken if a non-blue vertex is selected. So, these extra "dummy" choices do 
not alter the probability of a given colouring of G being obtained and our result follows. 
I 

In the remainder of this paper, we shall work in the probability space of the sequence 
S of sets of labels. So, each time a colouring of graph G is mentioned, the colouring 
induced by S is meant. 
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3 Independence lemmas 



We prove results that allow us to compute the probability, using local information only, of 
a vertex of an r-regular graph G being assigned some given colour at time i. Henceforth, 
we shall fix an r-regular graph G = (V, E) with girth g and consider a sequence of sets 
S = [S(v) : v G V(Cr)], where i G N is in S(v) with probability p , if % — 0, or p, if 
i > 1, for all v . 

Lemma 3.1 Let G = (V,E) be a graph and consider a sequence of sets of labels S = 
[S(v) : v G V]. Given u G V , define a sequence of sets of labels S' by replacing, in S, 
S{u) by some set S'(u). Let w be a vertex of G whose colours at time % with respect to 
S and S' differ, where % is a nonnegative integer. 

Then, there exists a path V from u to w for which every vertex except possibly w 
gained or lost a relevant label less than or equal to i when S was replaced by S' . Moreover, 
the relevant labels gained or lost by each vertex along the path are in strictly increasing 
order when the path is considered from u to w. 

Proof The proof is by induction on i. For i = 0, since the colour of w at time has 
changed after replacing S(u) by S'(u), it must be that u has gained or lost relevant label 
and that either u = w or u and w are neighbours. In both cases, V = (u, w) satisfies 
the conditions in the statement of this lemma. 

Now, let i > and assume that this result holds earlier. If u = w nothing needs to be 
done, so suppose that this is not the case. Since the colour of w changed at time i, there 
exists a neighbour w' of w that gained or lost a relevant label smaller than or equal to i. 
If w' = u, our result is clearly true, so suppose that they are distinct. Then, the relevant 
label gained or lost by w' is not equal to and, by the definition of relevant label, there 
is a neighbour w" of w' that gained or lost a relevant label at a time j strictly smaller 
than the relevant label gained or lost by w' . In particular, the colour of w" changed at 
time j, so, by induction, there is a path V" from u to w" under the conditions of the 
lemma. Thus, the path V obtained by appending vertices w' and u to V" satisfies the 
required properties. I 

Corollary 3.1 Let u G V(G) andi G N. Then, for any given colour c and any collection 
of subsets S' v of N, where v ranges over the vertices at distance at least i + 2 of u, the 
event that u has colour c at time i is independent of the event that S{y) = S^. 

Proof It is sufficient to show that, if S = [S(v) : v G V(G)] is any given family of sets 
of labels and new sets S'(v) are assigned to each vertex v satisfying d(u, v) > i + 2, then 
the colour of u at time % relative to S is the same as the colour of u at time % relative to 
S', where S' is obtained by replacing each S(v) by S'(v). 

We now prove this sufficient condition. Suppose for a contradiction that the colours of 
u with respect to S and S' differ, and order the vertices v G V satisfying d(u, v) > i + 2 
as t>i,t>2, • • • ,v m . Consider, for I G {0, . . . , m}, the sequences Si obtained from S by 
replacing S(vi), . . . , S(vi) by S'(vi), . . . , S'(vi). Our assumption implies the existence of 
j such that the colours of u with respect to Sj and Sj + i are distinct. By Lemma [XT] 
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there is a path V in G from Vj + \ to u such that every vertex except possibly u gained 
or lost a relevant label less than or equal to i when Sj was replaced by Sj + x- Also, the 
relevant labels gained or lost on each vertex along the path are in strictly increasing 
order when the path is considered from vj + \ to u. In particular, V contains at most 
i + 2 vertices, i.e., d(u,Vj + i) < i + 1, a contradiction. I 

Let Bi and Wi denote the sets of vertices coloured blue and white at time i, respec- 
tively. 

Corollary 3.2 Let u G V and let v be one its neighbours. Then, the probabilities 
P(u G Wi), P(u G B^, P(w G WiAv G Wi), P(w G 5, An G Wi) andP(u G ^Av G 5,) 
are independent of u and v whenever 2i < g — 3. Moreover, if we let w be a neighbour 
of u distinct from v, P(v E Bi Au E Bi Aw E R<i) does not depend on u, v or w. 

Proof We know from Corollary 13. II that the colour of u at time i depends only on the 
sets of labels of vertices at distance at most % + 1 from u. In other words, u is fully 
determined by the sets of labels in the subgraph G u = G[{v : dc(u,v) < i + 1}]. But 
our restriction on i implies that, for every u G V, the graphs G u are isomorphic. Our 
first two claims immediately follow, since distinct vertices are assigned sets of labels 
independently with the same probability. It is clear that an analogous argument can be 
used to prove the remaining statements. I 

Lemma 3.2 Let u G V and let vi,...,v r be its neighbours. Fix i,k G N such that 
2(i + 2) < g — 2 and consider, for each j G {1, . . . , r}, the tree Tj rooted at Vj given by 
the component of G[{v : dc{u,v) < 2} — u] containing Vj. 
Then, the following assertions hold. 

1. Let Xi, . . . ,X r be colourings of the tree isomorphic to the rooted trees Tj (the iso- 
morphism is a consequence of our restriction on i, k). Then, conditional upon u G Wi, 
the events E\, . . . , E r are mutually independent, where Ej stands for the event that Tj 
has colouring Xj at time i. 

2. Conditional upon u G Bi and v\ G R<i for some G {1, . . . ,r} ; the same events Ej 
are mutually independent for all j ^ I. 

Proof By Lemma 13.11 the colour of a vertex w at time i is altered when replacing 
S%(v) X • • • X S r (v) by S[(v) X • • • X S' r (v) only if there is a path V from v to w such that 
all the vertices on V that are not purple at time i with respect to S have a different 
colour with respect to S'. This is because, given any non-purple vertex w' at time i lying 
on V, it either gains a relevant label, in which case it is purple at time i with respect to 
S', or it is equal to w, in which case its colour changes by assumption. 

Now, if w and v are vertices in different branches with respect to u, our restriction 
on i implies by Corollary 13.11 that any path from v to w that is short enough for every 
interior vertex to gain or lose a relevant label passes through u. Hence, conditional upon 
u being white, changes in S\{v) x • • • x S r (v) do not affect the colour of w at time i. 

Moreover, we also know by Corollary 13.11 that the colour at time i of vertices at 
distance at most two from u are not affected by changes in the set of labels of vertices 
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whose distance to u is greater that i + 3. Let V u ^k be the set of vertices in G at distance 
at most k + i + 1 from u, excluding vertex u. 
So, 

P(£x A E 2 A . . . A E r | w e W) = X) P (%) = 5 - Vw G ^A* I M G W & 

* 

where denotes the sum over vectors (S v : v G Kt,i,ifc) such that the event S'(u) = 
S v , Wv G Kt,i,fc, implies E\ A E 2 A . . . A E r . Now, observe that our restriction on i, 
implies that the trees are a n disjoint. In particular, we can first sum over sets 

of labels of vertices in r p^ k+%+1 ^ (notation X]**) an d then over the remaining vertices 
(notation XL**) to obtain 

P(E 1 AE 2 A...AE r \ueWi) = ^^P(S(t;) = S v ,Vv G V Uji>k \ u G Wi). 

** *** 

Using conditional probability and rearranging the sum, this becomes 

J2P(S(v) = S v ,\fv G if +m) | u G Wi)x 
** 

x ^P(S(v) = S v ,Vv i if | (u G W) A (S(v) = S v ,Vv G if -* +1 >)) 
*** 

= ^2P(S(v) = S v ,Vv G jf | u G Wt) P{E 2 A . . . A E r | u G W) 
** 

= P(Ei | u G H^) P(£ 2 A . . . A E r | W G Wi). 

These manipulations can be done since, conditional upon u being white, changes in S(v) 
do not affect the colours of other branches, for any v G j"^ k+l+1 \ 
Repeating this argument for the remaining branches, we obtain 

r 

P(E 1 AE 2 A...AE r \ueWi) = Y[ P(Ej I u G Wi), 

3=1 

and our first claim is true. 

For the second part, we proceed analogously by leaving both the blue vertex u and 
the branch of its neighbour with relevant label untouched, and then summing over all 
possibilities of labels for vertices in the other branches. I 

4 Applications of the Independence Lemmas 

In this section, the independence results of the previous section will be used to obtain 
recurrence equations relating the probabilities of events that are important in the anal- 
ysis of Algorithm 12.11 We introduce some notation. Let u be a vertex of graph G. An 
arbitrary neighbour of u will be denoted by v, while we use v%, . . . , v r to refer to the set 
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of neighbours of u. When u has a neighbour with relevant label, this will be referred as 
Vk and we shall assume that v ^ V}.. 

Furthermore, for any i > 0, we know by Corollary 13.21 that the quantities Wj = P(w 6 
Wi), k = P(u G B,), qi = P(u G Wi Av G Wi), = P(u G B { Av G Wi) and ^ = P(w G 
B, A?; G -Bi), or even P(u G 5j A f G Wj A t> k G -R<;) and P(w G B; A t> G B { A v k G -R<;), 
do not depend on u, v or /c. We now let i > 1 and establish the following consequences 
of the previous independence lemmas. 

Corollary 4.1 

(i) Let J = {j u . . .,j k } C {1, . . . ,r}. Then, 

P{ Vj £R h VjeJ\ue Wi„i) = Y[P(ij iRi\ue WU). 

(it,) Let J C {1, . . . ,r} \ {k}. Then, 

P(vj (£R h Vj G J \ ue A t>fc G i2<i_i) = 

]3 P ( v j £ #i I M G B i-i A Wfc G i?<i-i). 

Proof We prove part (i) by induction on k. For k — 1, the result follows immediately, 
so let > 1 and assume the result holds for any smaller set J. 

First observe that, because a vertex receives relevant label i > 1 only if it is blue at 
time i — 1, it is important to consider the set of blue neighbours of u at time i — 1. In 
light of this, we associate a vector w G with the set of neighbours Vj t of it so that 
uj(t) = 1 if and only if Vj t G 

Note that, for a vertex not to become purple at time i, it either was not blue at the 
previous step or it was blue, but i is not contained in its set of labels. Thus, 

P( Vj i R h Vj G J | u G 

= P((v it G B,_i A i £ S(y jt ),Vt with w(t) = 1) A 

A (v jt £ Bi- h Vt with u(t) = 0) | u G 

The fact that contains any nonnegative integer independently at random (and label 
i does not influence the colouring at time i — together with Lemma [3T2l ensures that 
the events of the form (vj t G -Bj_i Ai £ S(vj t )) and Vj u £ -Bj_i are mutually independent 
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conditional upon u being white. So, the equation becomes 
P( Vj <£Ri,VjeJ\ue W^ x ) 

= Y1 II p (^* G B *-! A 1 $ s ( v ^ I u G Wi ~J x 

{jf-u>(t) = l} 

x n p ( u * ^ i m g ^-O- 

{j t :u;(t)=0} 

= P(t; jfc G A ^ £ S(v jk ) | u G Wi_x) II P ^ ^ I u G ^-0 X 

w'ez*- 1 O't:<*"(t)=0} 

x n p ^ g a * ^ s ^ i m g + 

{j«:«'(t)=l} 

+ p( % ^ i U g ^ n p ^* ^ i U g ^-i) x 

X JJ P(u jt G A i £ S(v jt ) | u G 

{*:w'(t)=l} 

= P(t; jfc G A-i A ^ £ 5(^J | u G ^_!)P(^ £ i^, VjeJ\{k}\ue + 
+ P(u jjk £ I u G ^_i)P(^- £ ifc, V? G J \ {A;} | u G W^) 

By induction, this is equal to 

P(v jk G A i £ 5(^J | u G JJ P(vj <£Ri\ue A f G i2<i_i) + 

jeJ\{fc} 

+ P(u jfc £ | u G Yl P ( v i i R i I u G A u * G ^<i-i) 

as required for (i). 

An analogous argument gives (ii). I 

Remark 4.1 This corollary can also be extended to conditioning upon u G Wj-i A 
f G where u,v are neighbours in G (or any other combination of restrictions on 

u,v being white or blue). As a matter of fact, if u±, . . . ,u r _i, i>i, . . . , iV-i denote the 
neighbours of u, v distinct from u and v, and J,K C {l,...,r — 1} ; i/ien 

P((u,- i R h Vj G J) A ( Ufc i Ri, Vk G X) I u G Wi_x A f G Wi_x) 

= n p ^ ^ ^ i u g n p (^ ^ R i\ ve w ^)- 

j£j k&K 

This can be obtained by expanding the initial probability into a sum over vectors uj G 
Z2 J ' + ' X ' and then using the fact that, for any event E, we have 

, rrr „r X P (E A 1i G I V G Wj-l) 

P(u G | v G 
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so that Lemma \3. S\ can be applied first with respect to u G Wi-! and then with respect 
to v £ Wi^i. It is clear that similar results can be stated by conditioning upon other 
combinations of u and v being white or blue. 

Corollary 4.2 

(i) P(ueW t | u e Wi-!) = (l ''""^ 



Wi-! 

r-1 



(a) p( u e B t | u e wu) = 1 1 - ^ 

Wi-! V w i-l 

Proof For (i), we just observe that, for u to cease to be white at time i, at least one of 
its neighbours has relevant neighbour i. Thus, 

P(u eWi\ue W^) = P( Vj i Rt, Vj\ue W^). 

Now, by Corollary 14.11 part (i), this last expression is equal to 



Y[P(vj <£Ri\ueW, 



i-i> 



Finally, Corollary 13.21 guarantees that the probability of Vj having relevant label i is 
independent of Vj and equals the probability of the event that i G S(vj) and Vj is 
coloured blue at time i — 1. So, 

P{ Vj eRi\ue W^) = 

Wi-! 

and 

(\ r 
l _PSi-l 
Wi-! 

as a consequence. 

Assertion (ii) may be proven using a similar approach. I 

Corollary 4.3 P(u G B % \ u G B^) = (1 - p) (l - rpt '~* 

Proof The fact that u is blue at step i — 1 implies that exactly one of its neighbours 
v !, . . . , v r has a relevant label less than or equal to i — 1. Thus, 

r 

P(u G Bi | u G Bi-!) =J]P(ii6 Bi \ u G B^AVk G R<i-!)P{v k G i?<*_i | u G B^). 

k=l 

Moreover, u remains blue at time % if neither itself nor any of its neighbours gains a 
relevant label at time i, i.e., 

P(u G Bi | u G Bi_!Av k G R<i-i) = (l-p)P(v j i Ri, Vj ^ k \ u G B^Av,, G i2<i_i). 
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By Corollary 14.11 part (ii), we obtain 

P(vj £Ri, Vj ^k\ue Bi-x Av k E R<i-i) 

= Y[~P(vj (£ \ u e Bi-x Av k e R<i-i) 

= (1 - pP(vj G | u e S_i Av k e J R< i _i)) r " 1 . 

The last equality follows from the fact that v G R4 only if it has label i and was blue at 
time i — 1. 

Finally, we note that 

P(Vj G it G A v k G i?<i_i) = J —— r-= 

P(u G Bi-i Av k e R<i-x) 

= P(vj e Bj_x Aue Bj_x)P{vk G R<j-x I Vj G Sj_i Ah 6 S_i) 
P(u G Si_i)P(u fe G ,R<i_i I u G A-i) 



By Corollary 13.21 we conclude that all the neighbours of u have the same probability of 
having a relevant label earlier than the other neighbours, since the probability of having 
relevant label i is equal to po, if i = 0, or if i > 1, for any vertex. In particular, we 

must have P(t> fe G S<i_i | w G S^) = \ and P(u fe G R<i-x \ Vj G B^xAu G Si_ x ) = 
So, 

r- 1 



P(u g Si I « g - (1 - p) V PK e R<i-i I « e Si-0 ( i - 7 ^A, 1 1 

= (l-p) (l 



fc=i 



(r - 1)6,-1 

with the last equation following from Ylk=i^( Vk e R<i-i I M £ Sj_i) = 1. This 
concludes the proof. I[ Nick June '07: Punctuation added, format changed: ] 
[ Carlos June '07: Some of the items were followed by commas, some by semi-colons. 
I replaced the two semi-colons by commas. ] 



Corollary 4.4 

(i) P{u G Wi A v G Wi | u G Wi-x Ave Wi-x) = ( 1 



2r-2 

PSi-X 



Wi-x 

2r-3 



(ii) P{u G Wi A v G ^ | u G A t; G = (r f 1 - PS<_1 



u>i_i V Wi-x 



/\ rpti_i 



fr-DVs 2 / nc. . n 2 

(Hi) P(u G Si A v G Si I it G A f G = , t-1 1 ' ' ' 

w 

(iv) P(u eWi Av G Bi | 116 AvE B { - X 

= (l-p) (1- 
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(v) P(u e BiAv e Bi I u e W-i a v e B 



(r - l)p(l -p)s i - 1 ( _ ysj-x \ 2 L rpti-i x 



Wi-i V Wi-1 J \ ( r - l)&t-l 

(vi) P(u E Bi A v E Bi | it G Ave B^) = (1 - p) 2 f 1 rp * i_1 



(r-l)6i-i. 

Proof Let «i, . . . , u r -i be the neighbours of u other than v and v±, . . . , iy_i be the 
neighbours of v distinct from u. Then, 

P(u eWiAveWi | u g Wi-i Ave W-i) 

= P{ui, . . . , u r -i, v x ,..., u r _i ^ -Rj I u G Ave W-i) 



r— 1 r— 1 



PSi-1 



k=l k=l 
2r-2 

1 "' S 1 ^ 



This is based on the remark after Corollary 14.11 

A similar strategy leads to the other formulae. I 



5 Differential Equations 

Using the expressions calculated in the last section, we can now determine recursive 
formulae for the variables introduced for the analysis of our algorithm. 

1. Formula for wf 

Wi = P(u E Wi) = P(u E Wi A u E Wi-i) 

PSi-l 



= P{u E Wi_i)P(u G Wi | U E Wi-x) = tWi-! 1 

V Wi-l 

2. Formula for bf. 

bi = P(u E Bi) = P(u E BiAuE B^i) + P(u E B t AuE W t -i) 

= P(u E Si-i)P(w EBi\uE + P(u E Wi_i)P(« E Bi \ u E W-i) 

= -p)(i- ^%ht 1 + f 1 - — r • 

3. Formula for 

9i = P( u G Wi A v G Wi) 

= P(u E W-i Af £ W-i)P(m e W A v E Wi I u G W-i A u G W-i) 

\ 2r-2 

1 

V Wi-i 
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4. Formula for s, : 



Si = P(u e Bi Av e Wi) 
= P(u e Bi_! Ave W^_i)P(w e BiAv eWi \ue B { _ x Ave 
+ P(u e Ave Wi_i)P(u eBiAt/e^lwe W { - x Av e W { - x ) 



(r - l)ggj-i£i-i A _ ps^_i 



2r-3 

5. Formula for t^: 

ti = P(u e BiAv e Bi) 

= P(u e A i> G Sj_i)P(u e Bi A v e Bi \ u e B^ Ave + 
+ P(u G A f G Wi_i)P(u G ^ A u G B< | u G A v G + 
+ P(v e Wi-! Ave Bi-i)P(u e BiAv e Bi\ue Ave B^) + 
+ P(v e Wi^ x Aue Wi_i)P(« G -Bj A f G -Bj I m G A u G 

=^(i-P) 2 fi-^^V r " 4 + 



1)6, 



i-1 



+ 2*^(1 - p) ( 1 - ? ^ "I ^ (r ~ ^ ( 1 - ™=iV" 2 + 



Wi-i \ Wi-! 

We need to evaluate wo, bo, qo, so and to to have the necessary initial conditions 
for solving the system of recurrence equations found above. It is easy to see that 
w = P(u e Wo) = (1 — Po) r+1 an d b = To(l — PoY, since for the former neither u nor 
its neighbours can have relevant label 0, and for the latter u cannot have relevant label 
0, but exactly one of its neighbours must have it. 

Now, 

q = P( u e Wo A v e Wo) = (l - Po ) 2r , 

since the event v G W Aue W is equivalent to neither u, v nor any of their other 
neighbours being chosen in the first phase of the algorithm (and each vertex is chosen 
independently with probability po). 
The equation for sq is given by 



2r- 1 



s = P(u e B A u e Wo) = (r - l)p (l - Po) 

because v e Bo Au e Wo occurs when u, v are not chosen, no neighbours of u are chosen 
and precisely one neighbour of v is chosen. 
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Finally, the equation for t is 

to = P(« e B A v G S ) = (r - l) 2 pg(l - Po) 2r ~ 2 

with similar justification. 

The recurrence equation for to, obtained at the beginning of this section can be seen 

as 

Wi = Wi-i - prs^i + 0(p 2 ). 

For p small, the term 0(p 2 ) should only have a minor influence. Similarly, each of the 
other equations of the system of recurrence equations can be rewritten as a main term 
added to a term of the order of p 2 . By ignoring the latter, we obtain the following 
auxiliary system of recurrence equations: 

w\ = w[_ x - prs'i-i 

V i = V i - 1 +p{-V i - 1 -rt' i _ 1 + rs' i _ 1 ) 

, , (2r - ^-A-x 
Qi = Qi-i - P ; 

, , ( / , i r - l )i-i s 'i-i 

*, = «,-i+p(^-i + ^ 

(r-l)^-! 2 r(r-2)^-! X ^ 

<-i (r - m~i 

2(r-l)sl 1 2 2r(r-2)tU 2 

<-i ( r - l ) h 'i-l 

rpo(l -PoY, q' = (1 -Po) 2r , 
s' = (r- l)po(l - Po ) 2r -\ t' Q = (r- 1)V Q (1 - p ) 2r " 2 

Note that the auxiliary system of recurrence equations (15.1 j) can be converted into 
a system of differential equations by means of first order approximations. Setting p = e 
in the recurrence equation for w[ obtained above implies 

w'i-w'i-i = -ers'ir-i, 
so that, for e small, we are interested in w, s satisfying the differential equation 

dw 

— — = —rs. 

ax 

Applying the same argument to the other recurrence formulae in (15.11) . the following 
system of differential equations arises. This system will be referred to as the system of 
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differential equations associated with (r,p ). 



-rs 



dw 
dx 

s 

dx 

dq _(2r-2)qs 
dx w 



-b — ri + rs 



ds (r — l)qs (r — l)s r(r — 2)st (5.2) 

dx w w ( r — 1)5 

di « 2(r - l)s 2 2r(r - 2)t 2 

— = -2t + — t-!- i ir- 

dx w ( r _ \)b 

w(0) = (l-p ) r+ \ 6(0) = rpo(l -p ) r , q(0) = (1 -p ) 2r , 
1(0) = (r - l)po(l -po) 2 - 1 , f(0) = (r - l) 2 p 2 (l - po)^" 2 . 

Given p G (0, 1), T > and 7 > 0, where 7 < min{w , 60, <?o, sq, ^0}, this system of 
differential equations has a solution in the domain ^(7, T) = {(x, w, b, q, s, t) e (—7, T) x 
(7, l) 2 x (7, l) 3 } which may be uniquely extended arbitrarily close to the boundary of 
the domain, by a standard result in the theory of first order differential equations (see 
Hurewicz [9], Chapter 2, Theorem 11). 

As expected, there is a connection between the original system of recurrence equa- 
tions and the system of differential equations (|5.2p . This connection is summarised 
in the lemma below and follows from the solutions to the original system being well- 
approximated by the solutions of the modified system (15.11) . as well as from the relation 
between the solutions of (15. ip and of (I5.2p given by Euler's method. The proof is routine 
so is omitted. 

Lemma 5.1 Let r > 3 be an integer and po G (0, 1). Let ko > such that the system of 
differential equations A5.2\) with the initial conditions defined by po has positive solutions 
in Q defined at x = k$. Then, given £ > 0, 

(i) there exists €q > satisfying the following property. I/O < e < eo and the system of 
recurrence equations ( f5.jp is solved withp = e, then \wi—w(ei)\ < £ ; \b— b(ei)\ < £ ; 
\qi - q(ei)\ < £, \si - s(ei)\ < £ and \U - t(ei)\ < for i = 0, 1, . . . , \k /e] . 

(ii) there exists e\ > such that for < e < e\, 

nk \k /e]-l 



/ b(x) dx — ebi 
Jo i=0 



< £, /or every < e < ei. 



Using this lemma, we can now determine additional properties of the solutions to 

a. 
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Lemma 5.2 Given po G (0, 1), the system of differential equations A5.2\) has unique 
solutions w(x), b(x), q{x), s(x) and t(x) defined over the entire nonnegative real line 
satisfying the following properties: 

(i) w{x), b(x), q{x), s(x) and i(x) are positive, 

POO 

(ii) / b(x) dx converges. 
Jo 

Proof As mentioned before, a standard result in the theory of first order differential 
equations ensures that, for p Q G (0,1), T > and 7 < mm{w , b , q , s , t }, the sys- 
tem of differential equations has a solution in the domain f2(7,T) = {(x, w, b, q, s, t) G 
(— 7, T) x (7, l) 2 x (7, l) 3 } which may be uniquely extended arbitrarily close to the 
boundary of the domain. 

Given po G (0, 1) and T > 0, we show that there exists 7 = 7(T) > such that 
this system of differential equations in the domain ^(7, T) has a unique solution defined 
for x arbitrarily close to x — T. This implies that the solutions are defined over the 
nonnegative real line. 

Suppose on the contrary that, for some T > 0, no j(T) with the above property 
exists. Let Xq denote the infimum of such T. Let (x', w', b', q', s', if) be any point in the 
interior of a region 12 (70, xo) such that a solution to the system of differential equations 
exists for < x < x' and w{x') = w', b(x') = b', q(x') = q', s(x') = s' and t(x') = t', 
where 70 > 0. By Lemma |5TT| given £ > 0, there exists eo such that for < e < eo and 
< i < \x'je\ , 

\wi - w(ei)\ < £, \bi - b(ei)\ < £, \qi - q(ei)\ < £, \si - s(ei)\ < £, \U - t(ei)\ < £. 

Since the quantities Wi, bi, qi, s» and ti represent probabilities of specific events after i 
steps of a randomised algorithm, we conclude that 

qi + Si < Wi, Si + U < bi. 

Using this and the fact that w' , b' , q' , s' , t' > 70 (which is independent of £), we have 

max{g(x'), s(x')} < w(x'), max{s(x'), t(x')} < b(x'). (5.3) 

Let m be a positive integer such that l/m < min{w , 6 , q , s , t Q }. The definition 
of Xo implies that one of the functions w, b, q, s, t must get arbitrarily close to in the 
neighbourhood of a point Xo, < xo < T. By (|5.3p . it must be one of q, s or t. (Note 
that this argument also applies in the case that x — 0.) 

Suppose this is the case for q. Let x' be such that the system of differential equations 
have a positive solution in [0,x']. Recall that 

dq (2r - 2)qs _ „ ( ~(2r - 2)§\ 
dx w \ w J ' 
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and, by equation (15.31) . — (2r — 2)s(x)/w(x) > — (2r — 2) for < x < x'. Now, if / is 
the solution for 

^ = -(2r-2)/, /(0)=g(0), 

we must have q(x) > f(x) for every x in the interval [0, x'\. However, fix) = f(0)e~^ 2r ~ 2 ^ x 
is a strictly positive function in this interval bounded below by the constant f(0)e~^ 2r ~ 2 ^ x . 
So q(x) cannot approach at x' . Similar arguments yield contradictions for the cases 
when s{x) or t(x) approach in the neighbourhood of the point x , since 

ds f (r — l)q (r — l)s r(r — 2)t 
dx I w w {j- — 1)6 

r(r — 2) 



> -1 - (r - 1) 



r — 1 



and 



dt ^ 2(r - l)s 2 2r(r - 2)t 2 
u) ( r - 1)6 

> (_ 2 _ *±=m i 

Thus, the solutions to the system of differential equations are indeed defined over 
the entire nonnegative real line. Furthermore, the previous argument ensures that they 
are positive, concluding the proof of part (i). 

For part (ii), note that the differential equations for w and 6 in (15.21) imply 

d(w + 6) . ; 
; = — 6 — rt, 

dx 

so 

f , . d(w + b) , . , d(w + b) , , 

bix) = ^ -(x) - rtix) < ^ J -(x), Var. 

dx dx 

As a consequence, for every T > 0, 

b(x)dx< [ - d{w + h ^ ( x ) dx = w(0) + 6(0) - w(T) - 6(T) < w(0) + 6(0). 
This proves part (ii). I 



6 Proof of Theorem 1.1 



As mentioned in the introduction, we wish to obtain a lower bound on the cardinality 
of a largest vertex subset that induces a forest in an r-regular graph G not containing 
short cycles. Recall our definition of r(G), given by 

t{G) = max {|V(F)| : F is an induced forest in G}. 
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Let G be an r-regular graph on n vertices with girth g and consider the set P of 
purple vertices at the end of step 2 when Algorithm l2.1l is applied to G with N < g/2 — 2. 
It is clear that the induced graph G[P] contains a cycle only if some vertex v with at 
least two purple neighbours has been added to P. By the description of our algorithm, 
this cannot happen unless v was selected in the same step as one of its neighbours. It 
follows that, if P is the set obtained from P by deleting any pairs of adjacent vertices 
added to P in the same step, the induced subgraph G[P] is acyclic. 

Now, given a vertex in the probability that none of its neighbours is also selected 
is at least (1 — p) r , since a vertex has at most r neighbours that could be added to B4. 
Therefore, the expected number of vertices added to P at time i that are not removed 
is at least p(l — p) T hi_\n and 

N 

E|P| >p (l-Po) r n + X)K 1 -lO'Vi« (6- 1 ) 

Part of the set W of white vertices produced at the end of the algorithm will also be 
added to the forest. By definition, these vertices have no purple neighbours, so that no 
cycle containing purple vertices is created by adding white vertices to P. Thus G[PUlf] 
is still acyclic, where W denotes the set of vertices in acyclic components of Cr[W]. 

Now, since G has girth g, no cycles appear if we add white vertices lying in compo- 
nents of Cr[W] of size at most g — 1. Therefore, a lower bound on the size of W can be 
obtained by estimating the number of vertices in small components of G[W]. This will 
be done through a branching process argument. 

To define the branching process, start with a white vertex t> and set the random 
variable Y = {v Q }. In general, Y$ denotes the set of white vertices already exposed, but 
whose neighbours have not been considered yet. Define U = V(G) — {v } and let C/j be 
the random variable accounting for the set of vertices which have not been exposed by 
the branching process up to step i. After step i, either \Yi\ = 0, in which case the process 
has died out, or \Y{\ > 0, in which case we choose a white vertex V{ in Yj, expose its 
white neighbours N w {yi) C U i and define Y i+1 = Yj U N w {vi) — {fj}, U i+ i — Ui\ N(vi). 
We are interested in estimating the probability that |Yg-i| > 0, i.e., that the branching 
process has not died out after g — 1 steps. 

Proposition 6.1 Let 5 > 0, fix an integer r > 3 and suppose the existence of po > 
such that the solutions to the system of differential equations associated with {r,po) 
(r — 

satisfy lim — — < 1. Then, there exist g > 0, < N < g/2 — 1 and < p < 1 

x^co W[X) 

such that, if Algorithm \2.1\ is applied to an r-regular graph G with girth at least g for N 
steps with probabilities (po,p), then 

p(in/2_ii > 0) < s. 

Proof Let Zj denote the random variable counting the number of neighbours of Vi 
in C/j. Note that Z has binomial distribution Bin(r, Qn/wn), since Corollary 13.21 and 
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Lemma I3~2l ensure that, conditional upon v$ being white, the events associated with each 
of its neighbours being white are mutually independent and have probability q^/w^. 
Furthermore, Zi has distribution Bin(r — 1, Qn/wn) for every z > 1, since the condition 
< % < g — 1 implies \N(vi) (1 Ui\ = r — 1, and Corollary 13.21 and Lemma 13.21 are 
applicable in the same way. 

Let k > such that the solution to the system of differential equations (I5.2p satisfies 
(r - l)q(k )/w(k ) < 1. 

( r _ l\(q(k ) + £) 

Let £ > be such that - — r < 1. Fix eo as in Lemma [57TI part (z), and 

w(ko) - i 

let e < eo such that A^ = fco/e is an integer. Now, apply Algorithm 12.11 for A" steps with 
the given po and p = e, for all z > 1, to a graph G with girth ^ > 2A^ + 3. Then, 

(r ~ l)q N < (r-l)(g(fcb)+Q x 



w N ~ w(A; ) - £ 

So, we have (r — l)q N /wN < 1, and a branching process argument as in [1] shows that, 
by choosing g sufficiently large, P(|F 9 / 2 -i| > 0) < 5, as required. I 

By the above proposition, given 5 > and po > such that the solutions to the 

(t — l)^(x) 

system of differential equations associated with (r,p Q ) satisfy lim — — < 1, we 

x^oo w(x) 

may fix g, N and p so as to have the property P(Y g /2-i > 0) < 5, i.e., P(Y g /2-i = 
0) > 1 — 8. It follows that for such g the expected number of white vertices in acyclic 
components of G[W] is bounded below by 

(l-5)w N n. (6.2) 

We are now ready to prove Theorem 11.11 

Proof of Theorem 11.11 Fix r G N and 5 > 0. We show that, given p E (0, 1), the 
inequality r(G) > (£(po) — $)n holds, where 

- (r - l)q(x) 

y ™ ™ W (x) 

Po(l — Po) r + / b(x) dx, otherwise. 

Here, w, b and g are solutions to the system of differential equations associated with 
(r,po). By Lemma 15.21 this system has positive solutions w,b,q,s,t defined over the 

POO 

nonnegative real line such that / b(x) dx converges. 

Jo 

Let k > be such that, for every k > ko, 



b(x) dx — I b(x) dx 



Using Lemma [5. 1[ fix eo > such that 



\wi - w{ei)\ < -, z = 0, 1, 
6 



< \ (6.3) 



ko 
e 
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and fix ei > satisfying 

ffe 



PKO a I e I 

/ b(x) dx — ebi 
Jo 



i=0 



< -, for every < e < ei. 
6 



Let e = min{e , e 1; 1 - (1 - 5/6) 1/r } and JV = \k /e] . Fix # = 2N + 3. Then, given 
an r-regular graph G with girth larger than or equal to g, we apply Algorithm [2U] for N 
steps with probabilities (po,P = e). The first moment principle leads to a lower bound 
for t{G). As a matter of fact, our lower bound (16. ip on the cardinality of P implies 



E|P| > npo(l - PoT + n(l - e) r {j^ eb^A 



> np (l -p ) r + n(l - e 

> np (l -p ) r + n ( 1 - 

> n (p (l -Po) r + 



b{x) dx — 



6 

b{x) dx - 

u 6 

b(x) dx ' 



(6.4) 



If, in addition, the solutions to the system of differential equations associated with 
(f — l)(^(x) 

(r,po) satisfy lim — — < 1, Proposition 16.11 establishes a lower bound (\6.2\\ on 

x->oo w(x) 

the cardinality of the set W of white vertices that can be added to the forest. Clearly, 
k in (16. 3p may be chosen so that, for every k > k , we also have 



lim w(x) — w(k) 

X — >OD 



< 



and 



(r — l)q(k) 
w(k) 



< 1. 



The girth g can also be taken larger, if necessary, to ensure that the size of W is bounded 
below by (1 — 5/6)wNn. 
Thus, 



E\W\ > n ( 1 - ^ j w N > n ^1 - ^ ) w(eN) 
> // | 1 ] lim w{x) > n lim w(.r) 

6 / x^oo x— >oo 



5 n 
~2~' 



(6.5) 



Now, given that t(G) > E|P U W| and using equations f]6.4f) and (16. 5j) . we conclude 



that 



t(G) > (e(p„) - 5) n, 
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as claimed. Numerical calculations of these quantities lead us to the bounds in Table 1. 
We note that, for every value of r tested, we were able to choose a constant p such that 
the numerical solutions to the system of differential equations associated with (r,p ) 

satisfy hm t^lM < 1. ■ 

x~*oo W{X) 
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